Towards a Phonetic Brazilian Portuguese Spell Checker
نویسندگان
چکیده
Spell checking is no longer considered a big challenge for natural language processing, at least regarding the task of correcting documents during edition. Nevertheless, without human interaction, it is necessary to automatically choose the word that will more likely correct the misspelled word. Also, there is a further difficulty for spell checking: new types of errors on the web material have emerged due to the increasing participation of general public, especially when expressing opinions, feelings and requests, which take many characteristics from the spoken language. This paper presents the first efforts towards a new Brazilian Portuguese (BP) spell checker to deal with the challenges that emerged in the automatic processing of a web corpus, including a new phonetic algorithm to specifically address spelling correction in BP. The speller proposed here is able to correct 16% more words than Aspell, in a web corpus composed of reviews of products.
منابع مشابه
A Novel Binary Spell Checker
In this paper we propose a simple, flexible and efficient hybrid spell checking methodology based upon phonetic matching, supervised learning and associative matching in the AURA neural system. We evaluate our approach against several benchmark spell-checking algorithms for recall accuracy. Our proposed hybrid methodology has the joint highest top 10 recall rate of the techniques evaluated. The...
متن کاملPhonetic based SoundEx & ShapeEx algorithm for Sindhi Spell Checker System
This paper presents a novel combinational phonetic algorithm for Sindhi Language, to be used in developing Sindhi Spell Checker which has yet not been developed prior to this work. The compound textual forms and glyphs of Sindhi language presents a substantial challenge for developing Sindhi spell checker system and generating similar suggestion list for misspelled words. In order to implement ...
متن کاملWebJspell, an Online Morphological Analyser and Spell Checker
Webjspell is an Internet multipurpose tool for Portuguese morphological analysis and spell checking. It provides examples of phrases, frequencies, verbal conjugation tables, word suggestions, and Internet pages spell checking. This article describes Webjspell features, and results.
متن کاملA Comparison of Standard Spell Checking Algorithms and a Novel Binary Neural Approach
In this paper we propose a simple, flexible and efficient hybrid spell checking methodology based upon phonetic matching, supervised learning and associative matching in the AURA neural system. We integrate Hamming Distance and n-gram algorithms that have high recall for typing errors and a phonetic spell-checking algorithm in a single novel architecture. Our approach is suitable for any spell ...
متن کاملDesign and Implementation of Punjabi Spell Checker
Spellcheckers are the basic tools needed for word processing and document preparation. Designing a spell checker for Indian languages such as Punjabi poses many new challenges not found in English, which complicates the design of the spell checker. Punjabi language is far different from Western languages in phonetic properties and grammatical rules. Thus the existing algorithms and techniques t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014